Representative  Benchmark  Suites  for  Barrier  Heights  of 


Diverse  Reaction  Types  and  Assessment  of  Electronic 
Structure  Methods  for  Thermochemical  Kinetics 

Jingjing  Zheng,  Yan  Zhao,  and  Donald  G.  Truhlar* 
Department  of  Chemistry  and  Supercomputing  Institute, 
University  of  Minnesota,  Minneapolis,  Minnesota  55f55-0f31 

Prepared  for  J.  Chem.  Theory  Comput. 

December  19,  2006 


*  Corresponding  author,  e-mail:  truhlar@umn.edu 


1 


Report  Documentation  Page 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 

1.  REPORT  DATE 

19  DEC  2006  2' REPORT  TYPE 

3.  DATES  COVERED 

00-00-2006  to  00-00-2006 

4.  TITLE  AND  SUBTITLE 

Representative  Benchmark  Suites  for  Barrier  Heights  of  Diverse 

Reaction  Types  and  Assessment  of  Electronic  Structure  Methods  for 
Thermochemical  Kinetics 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROTECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

University  of  Minnesota, Department  of  Chemistry  and  Supercomputing 
Institute, Minneapolis, MN, 55455-0431 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

10.  SPONSOR/MONITOR'S  ACRONYM(S) 

11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

14.  ABSTRACT 

see  report 

15.  SUBIECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF:  17.  LIMITATION  OF 

_ _ _  ABSTRACT 

18.  NUMBER  19a.  NAME  OF 

OF  PAGES  RESPONSIBLE  PERSON 

a.  REPORT  b.  ABSTRACT  c.  THIS  PAGE  Same  OS 

unclassified  unclassified  unclassified  Report  (SAR) 

47 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


Abstract 


We  propose  three  small  sets  of  barrier  heights  for  heavy-atom  transfer,  nucleophilic 
substitution,  and  unimolecular  and  association  reactions  as  benchmarks  for  comparing 
and  developing  theoretical  methods.  We  choose  the  data  sets  to  be  statistically  represen¬ 
tative  subsets  of  the  NHTBH38/04  database.  Each  data  set  consists  of  6  barrier  heights; 
we  call  these  small  benchmark  suites  HATBH6,  NSBH6,  and  UABH6.  Benchmark  val¬ 
ues  are  tabulated  for  205  combinations  of  theory  level  and  basis  set.  The  theory  levels 
studied  include  single-level  wave  function  theory  like  Hartree-Fork,  Mpller-Plesset  pertur¬ 
bation  theory,  quadratic  configuration  interaction,  and  coupled  cluster  theory;  they  also 
include  multicoefficient  correlation  methods,  local  and  hybrid  density  functional  theory, 
and  semiempirical  molecular  orbital  methods.  The  three  new  representative  data  sets 
are  combined  with  a  previous  representative  data  set  for  hydrogen-transfer  reactions  to 
form  a  new  compact  but  diverse  and  representative  data  set  called  DBH24.  Comparison 
of  a  large  number  of  methods  for  their  performance  on  DBH24  leads  us  to  recommend 
the  following  methods  for  barrier  height  calculations,  in  order  of  decreasing  cost:  G3SX, 
BMC-CCSD,  PWB6K,  BB1K,  M06-L,  MPW1K,  HF/MIDI!,  and  PM3. 
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1  Introduction 


Data  sets  composed  of  experimental  and  high-level  electronic  structure  results  are  very  useful  for 
assessing  the  performance  of  new  theoretical  methods.  Examples  of  such  data  sets  are  G2/97,1-3 
G3/99,2-4  Database/3,5  and  NHTBH38/04.6  These  databases  usually  contain  a  large  number 
of  data  to  ensure  diversity.  But  a  disadvantage  of  using  them  to  test  new  methods  is  that  it 
requires  a  burdensomely  large  number  of  calculations,  which  is  not  always  affordable.  Hence, 
it  is  useful  to  develop  a  smaller  set  of  data  that  is  representative  of  the  larger  database  and  can 
be  used  more  conveniently  as  a  benchmark.  With  this  motivation,  our  group  has  developed 
small  representative  benchmarks  suites7  for  the  atomization  energies  and  hydrogen-transfer 
(HT)  reactions  in  Database/3.  These  benchmark  suites  have  been  widely  used  because  of  their 
efficiency.  In  this  paper  we  develop  new  small  benchmark  suites  for  non-hydrogen-transfer 
(non-HT)  reactions. 

The  non-HT  database  NHTBH38/04  contains  38  barrier  heights  (BH)  for  heavy-atom  trans¬ 
fer  (HAT),  nucleophilic  substitution  (NS),  and  unimolecular  and  association  (UA)  reactions. 
Because  it  can  be  expensive  to  use  this  suite,  we  present  here  a  subset  of  data  that  has  sta¬ 
tistical  errors  close  to  those  exhibited  by  the  entire  set,  in  particular,  the  small  set  adequately 
reproduces  the  mean  signed  error  (MSE),  mean  unsigned  error  (MUE),  and  root-mean-square 
error  (RMSE)  of  the  full  set. 

A  modern  theorist  has  many  quantum  chemistry  methods  available  for  calculating  thermo¬ 
chemical  kinetics.  In  general,  high  level  ab  initio  wave  function  theory  (WFT)  methods  can 
give  very  accurate  results,  but  they  are  very  time-consuming.  Semiempirical  molecular  orbital 
methods  are  computationally  inexpensive,  but  less  accurate  and  less  reliable.  Density-functional 
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theory  (DFT)  has  proven  to  be  very  efficient  with  an  excellent  performance-to-cost  ratio.  Mul¬ 
tilevel  methods  can  also  achieve  high  accuracy  with  low  computational  cost  by  extrapolating 
the  results  of  some  low-level  methods.  In  the  present  work  a  wide  variety  of  methods  covering 
various  kinds  of  theory  and  basis  sets  has  been  tested  against  barrier  heights  for  both  the  full 
data  sets  and  the  representative  subsets  for  both  non-HT  and  HT  reactions.  These  methods  are 
assessed  according  to  their  mean  errors  for  calculating  barrier  heights  and  their  computational 
scaling  and  costs  in  order  to  allow  one  to  choose  the  theory  level  and  basis  set  according  to  the 
required  performance-to-cost  ratio  for  applications  of  thermochemical  kinetics.  In  addition,  the 
mean  errors  presented  here  can  be  used  to  estimate  the  reliability  limits  of  various  methods  of 
calculation. 

Section  2  describes  the  methods  and  the  database  used  in  the  present  work,  Section  3  describ 
the  strategies  for  selecting  a  representative  subset,  Section  4  gives  the  results  and  discussion, 
and  Section  5  gives  our  conclusions. 

2  Methods  and  Databases 

2.1  Theoretical  Methods 

In  order  to  develop  the  new  test  sets,  we  used  205  methods  (where  a  method  is  a  combination 
of  theory  level  and  basis  set)  to  calculate  the  barrier  heights  in  the  NHTBH38/046  database. 
For  DFT,  theory  “level”  means  choice  of  density  functional. 

The  single-level  methods  include  local  DFT  (BB95,8  BLYP,9,10  BP86,9,11  G96LYP,10  1213 
HCTH,14  M06-L,15  mPWKCIS, 16-21  rnPWLYP, 10,16  mPWPW,16  OLYP, 10,22  PBE,23  PBE1W, 23,24 
PW91, 25-29  TPSS, 20,21  TPSSKCIS, 17-21  SPL,30  SPWL, 31,32  VSXC,33  and  rHCTH34),  hybrid 
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DFT  (B1B95,8,9  B1LYP,8’36  B3LYP, 8,35,37  B3LYP*,38  B97-1,39  B97-2,39  B97-3,40  B98,41  BB1K,42 
BHandHLYP, 9,10,35  BMK,43  HFLYP,44  M05,45  M05-2X,45  M06-HF,46  MPW1B95,47  MPW1KCIS, 6,8,17-19 
mPWIPW,16  MPW1K, 16,17,48  MPW3LYP, 10,16,47  MPWB1K,47  MPWKCIS1K, 6,8,17-19  03LYP, 10,22,49 
PBE1KCIS, 17-19,23,50  PBE1PBE,23  PW6B95,51  PWB6K,51  TPSS1KCIS, 17-21,52  TPSSh, 20,21  X3LYP, 9,10,26,53 
and  rHCTHh34),  and  single-level  WFT  (Hartree-Fock  (HF),  M0ller-Plesset  second-,  third-,  and 
fourth-order  perturbation  theory  (MP2,54  MP2(full),  MP3,55  MP455),  Mpllcr-Plesset  fourth- 
order  perturbation  theory  without  triple  excitations55  (MP4SDQ),  coupled  cluster  theory  (CCSD,56 
CCSD(T)57),  and  quadratic  configuration  interaction  (QCISD58  and  QCISD(T)58)).  Note  that 
the  local  DFT  methods  include  strictly  local  ones  (SPWL  and  SPL),  GGAs  (BP86,  BLYP, 

HCTH,  G96LYP,  rnPWLYP,  mPWPW,  OLYP,  PBE,  PBE1W,  and  PW91),  and  rneta  GGAs 
(BB95,  M06-L,  mPWKCIS,  TPSS,  TPSSKCIS,  VSXC,  and  rHCTH),  where  GGA  denotes 
generalized  gradient  approximation.  In  single-level  WFT  methods,  core  electrons  are  uncorre¬ 
lated  (i.e.,  doubly  occupied  in  all  configurations,  sometimes  called  frozen)  except  for  MP2(full) 
calculations. 

We  also  tested  some  semiempirical  molecular  orbital  methods  such  as  AMI,59  PM3,60 
PDDG/PM3,62  PM6,61  RM163  and  SCC-DFTB.64  The  first  five  of  these  methods  are  based 
on  neglect  of  diatomic  differential  overlap  (NDDO),  whereas  SCC-DFTB  is  a  self-consistent- 
field  nonorthogonal  tight-binding  method. 

In  addition  to  the  above  single-level  methods,  we  also  used  WFT-based  multilevel  methods 
(CBS-4M, 65,66  CBS-Q,65  CBS-q,67  CBS-QB3, 66,68  G3SX,74  G3SX(MP3),74  MCCM/3,81  BMC- 
CCSD,69  MLSEn+d  (n=l-4),78  and  the  scaling  all  correlation  method79-81  (SAC-MP2  including 
SAC/3)).  Within  the  MCCM/3  suite,  we  considered  MCG3/3,  MC-QCISD/3,  MC-UT/3,  MC- 
CO/3.  Finally  we  studied  multicoefficient  extrapolated  DFT  methods  (MC3BB,82  MC3MPW,82 


5 


MC3MPWB,52  MC3TS,52  MCG3-MPW,  -MPWB  and  -TS,52  MCCO-MPW,  -MPWB  and 
-TS,52  MCUT-MPW,  -MPWB  and  -TS,52  MCQCISD-MPW,  and  MCQCISD-MPWB  and 
-TS52).  Note  that  multicoefficient  extrapolated  DFT  methods  are  examples  of  fifth-rung83  DFT. 

2.2  Basis  Sets 

The  theoretical  levels  are  combined  with  one  or  more  of  the  following  basis  sets:  6-31B(d),69 
6-31+B(d,p),69  6-31G(d),55  6-31G(d,p),55  6-31G(2df,p),55  6-311G(2d,p),556-311G(2df,2p),55  6- 
31+G(d),55  6-31+G(d,p),55  6-31+G(d,2p),55  6-31+G(2df,p),55  6-31++G(d,p),55  6-31++G(2df,2pd),55 
6-311++G(d,p),55  6-311++G(2df,2pd),55  aug-cc-pV(D+d)Z,70  aug-cc-pVTZ,71  aug-pcl,72  aug- 
pc2,72  cc-PV(D+d)Z,70  cc-PV(T+d)Z,70  cc-pVTZ,70  G3Large,73  G3XLarge,74  MG3, 75,76  MG3S,77 
MG3T,77  MIDI!,84  MID1X+, 84,85  and  MIDIY.84,85  Note  that  6-31+G(d,p)  is  abbreviated  to 
D1DZ  in  some  of  our  previous  papers.  We  also  note  that  MG3  is  also  known  as  G3LargeMP2 
and  identical  to  6-311++G(3d2f,2df,2p)  for  H  through  Si  and  very  similar  to  6-311+G(3d2f) 
for  P,  S,  and  Cl.  The  MG3S  basis  set  is  identical  to  MG3  except  that  diffuse  functions  on 
hydrogen  have  been  removed.  The  MG3T  basis  set  removes  all  diffuse  functions  from  the  MG3 
basis  set. 

2.3  Geometries,  Spin-Orbit  Coupling,  and  Software 

All  calculations  in  the  present  work  used  structures  optimized  at  the  QCISD/MG3  level  with 
the  spin-restricted  formalism  for  closed-shell  and  the  spin-unrestricted  formalism  for  open-shell 
systems.  The  effect  of  spin-orbit  coupling  was  added  to  the  energies  of  the  Cl,  F,  and  the  OH 
radicals,  which  lowers  their  energies  by  0.84,  0.38,  and  0.20  keal/mol,  respectively.80 

All  single-level  and  CBS  electronic  structure  calculations  in  this  work  were  performed  using 


6 


the  GaussianOS  package86  except  that  B97-3,  M05,  M05-2X,  M06-HF,  M06-L,  PW6B95,  and 
PWK6B  were  carried  out  with  a  locally  modified  version  of  GaussianOS ,  four  of  scmicmpiri- 
cal  molecular  orbital  methods  (AMI,  PM3,  PDDG/PM3,  RMl)  were  calculated  by  MOPAC 
5.012mn8'  program,  which  is  available  from  the  Truhlar  group’s  web  page,88  PM6  method 
was  calculated  by  MOPAC  7.261  program,  SCC-DFTB  calculations  were  performed  by  the 
DFTB64’89  program,  and  single-level  CCSD(T)  calculations  were  carried  out  by  the  MOLPRO90 
program.  All  multilevel  methods  except  the  CBS  ones  were  carried  out  with  the  MLGAUSS91 
program  in  conjunction  with  GaussianOS  except  that  the  MLSE  multilevel  methods  were  cal¬ 
culated  with  GaussianOS  and  a  Fortran  code.  For  all  single-point  energies  except  CCSD(T), 
we  used  the  spin-restricted  formalism  for  closed-shell  calculations  and  the  spin-unrestricted 
formalism  for  open-shell  calculations.  For  CCSD(T),  we  used  a  spin-restricted  calculation  to 
obtain  the  orbitals,  but  a  spin-unrestricted  correlated  calculation. 

2.4  NHTBH38/04  Database 

The  NHTBH38/046  database  for  non- hydrogen-transfer  reactions  consists  of  6  heavy-atom 
transfer  reactions,  4  bimolccular  nucleophilic  substitution  reactions,  4  unimolecular  nucleophilic 
substitution  reactions  of  ion-dipole  complexes  that  produce  product  ion-dipole  complexes,  and 
5  non-nucleophilic-substitution  unimolecular  or  association  reactions.  This  database  is  repeated 
in  Supporting  Information.  The  details  of  the  database  are  presented  in  Ref.  6.  As  explained 
in  more  detail  in  Ref.  6,  three  of  the  19  forward  barrier  heights  are  semi-experimental  (meaning 
that  they  are  derived  by  correcting  calculated  rate  constants  to  agree  with  experimentally  ob¬ 
served  ones),  and  the  other  16  are  based  on  Weizmann-1,6,92  and  Weizmann-292,93  calculations. 
The  reverse  barrier  heights  are  obtained  from  the  forward  barrier  height  and  the  energy  of 
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reaction.  All  barrier  heights,  in  both  directions,  are  zero-point-exclusive. 


3  Strategies  for  Selecting  a  Representative  Subset 

Since  the  NHTBH38/04  database  contains  three  different  types  of  reactions,  we  should  select  a 
subset  that  can  represent  all  the  reaction  types,  e.g.,  the  reactions  of  the  subset  should  not  come 
from  only  one  or  two  of  these  types.  Here,  we  used  two  strategies  to  select  the  representative 
subset. 

3.1  Strategy  A 

In  this  strategy,  we  selected  a  small  subset  that  represents  the  whole  database  with  the  min¬ 
imum  root-mean-square  deviation  (RMSD)  between  three  error  measures,  mean  signed  error, 
mean  unsigned  error,  and  root- mean-square  error  (MSE,  MUE  and  RMSE)  calculated  using 
NHTBH38/04  and  the  same  error  measures  calculated  using  the  subset  (SS),  e.g.,  the  devia¬ 
tion  between  the  MSE  using  NHTBH38/04  (MSE(NH38))  and  the  MSE  using  the  small  subset 
(MSE(SS)).  In  this  strategy,  all  the  subsets  must  contain  at  least  one  reaction  from  each  re¬ 
action  type.  This  restriction  guarantees  that  the  representative  subset  is  meaningful  for  whole 
database.  The  RMSD  is  calculated  by  using  Eq.  1. 

1  n 

RMSD  =  [—  Y^[(MSEi(NH 38)  -  MSE^SS))2  +  (MUEi(NH3 8)  -  MUE^SS))2 

*-■  (i) 

+(RMSE,(NH38)  -  flMSEifSS))2]]1'2 


where  n  is  the  number  of  methods  used  to  generate  representative  test  sets,  in  particularly 


n=154.  The  mean  error  (ME)  is  then  defined  by 

1  n 

ME  =  —  Y\{\MSEi{NmS)  |  +MUEi(NH 38)  +  RMSE(NH 38))  (2) 

3  77 

i=  1 

which  is  calculated  only  once  for  whole  database.  Finally,  we  calculated  the  percentage  error 
in  representation  (PEIR)  using 


PEIR 


100%  x 


RMSD 

ME 


(3) 


Note  that  although  the  present  study  includes  204  methods,  we  used  77=154  in  the  above 
statistical  analysis,  because  we  excluded  the  six  semiempirical  molecular  orbital  methods  at 
this  stage  because  they  increase  PEIR  significantly,  and  the  other  44  methods  were  only  used 
to  calculate  the  representative  subsets. 

This  choice  of  154  methods  yields  ME=5.26  kcal/mol.  The  value  of  RMSD  and  PEIR  were 
calculated  for  each  possible  subset  of  reactions  in  the  NHTBH38/04  database,  and  the  subset 
for  a  given  number  of  reactions  is  the  one  with  the  smallest  PEIR. 


3.2  Strategy  B 

An  alternative  strategy  of  generating  a  representative  subset  is  to  consider  the  three  types  of 
reactions  (HAT,  NS,  and  UA)  as  independent  data  sets  and  calculate  the  corresponding  subset’s 
RMSD  and  PEIR  for  each  reaction  type  (HAT,  NS,  and  UA)  by  using  equations  analogous  to 
Eq.  1,  2,  and  3,  respectively,  and  then  select  the  most  representative  subset  for  each  reaction 
type. 
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4  Results  and  Discussion 


4.1  The  Large  Non-Hydrogen- Transfer  Database 

Table  1  contains  mean  signed  and  unsigned  errors  for  NHTBH38/04  for  160  methods. 

4.2  Representative  Databases 

In  order  to  select  a  representative  subset  of  NHTBH38/04  database,  at  least  three  reactions 
are  required,  which  represent  the  three  reaction  types  in  the  database.  Hence,  we  calculated 
a  series  of  the  most  representative  subsets  consisting  of  3  to  9  reactions  according  to  strategy 
A  (see  Fig.  1(a)).  The  9-reaction  representative  subset  gives  the  lowest  PEIR,  namely  12.8%, 
and  it  consists  of  3  reactions  for  each  reaction  type.  Encouragingly,  strategy  B  also  selected  the 
same  reactions  for  the  representative  subset,  so  the  results  of  strategy  A  and  B  are  consistent 
with  each  other.  The  3-reaction  subsets  for  HAT,  NS,  and  UA  reactions  according  to  strategy 
B  have  similar  PEIR,  which  are  12.4%,  9.2%,  and  11.1%,  respectively  (see  Fig.  1(b)).  We  call 
these  representative  subsets  HATBH6,  NSBH6,  and  UABH6,  respectively.  In  a  previous  paper,7 
we  have  identified  a  representative  data  set  for  hydrogen-transfer  reactions,  namely  BH6.  We 
may  label  this  BH6  database  more  explicitly  as  HTBH6  in  the  present  paper.  The  combined 
set  of  these  four  representative  databases  is  called  DBH24,  where  D  denotes  diverse,  and  it  is 
given  in  Table  2.  As  explained  more  fully  in  Ref.  5  and  6,  four  of  the  12  forward  barrier  heights 
in  DBH24  are  semi-experimental  (dehned  in  Sect  2.4),  six  are  from  Weizmann-1  calculations, 
and  two  are  from  Weizmann-2  calculations.  The  reverse  barrier  heights  are  obtained  from  the 
forward  barrier  height  and  the  energy  of  reaction.  All  barrier  heights  in  the  database,  in  both 
directions,  are  zero-point-exclusive. 
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Figure  1: 


Percentage  error  in  representation  vs.  number  of  reactions,  (a)  is  for  strategy  A  and  (b)  is  for  strategy  B. 


The  HATBH6  data  set  is  based  on  three  reactions  involving  nitrogen,  fluorine,  and  chlorine 
transfer.  It  represents  the  six  heavy-atom  transfer  reactions  in  NHTBH38/04  database  not  only 
by  having  the  smallest  PEIR  but  also  by  covering  all  the  heavy-atom  transfer  types  in  the  whole 
database.  NSBH6  is  also  appealing  in  that  not  only  is  it  statistically  representative,  but  also  it 
involves  all  the  three  anion  OH-,  F-,  and  Cl-  that  appear  in  NHTBH38/04.  UABH6  includes 
two  open-shell  systems  and  one  closed-shell  system.  Fortunately,  the  most  time-consuming 
reaction  CH3  +  C2H4  — >  CH3CH2CH2  is  not  involved  in  this  subset. 

The  subsets  of  non-hydrogen-transfer  and  hydrogen-transfer  barrier  heights  yield  errors  that 
are  representative  of  their  respective  errors  using  NHTBH38/04  and  Database/3,  respectively. 
Table  3  gives  the  errors  calculated  using  the  HATBH6,  NSBH6,  and  UABH6  subsets.  The  size 
of  the  representative  subsets  is  less  than  half  of  the  whole  NHTBH38/04  and  using  these  sub¬ 
sets  can  significantly  reduce  the  computational  cost  for  testing  and  developing  new  theoretical 
methods. 


11 


4.3  Assessment  of  Methods 


We  have  tested  160  methods  against  the  full  NHTBH38/04  database  and  153  methods  against 
both  the  non-hydrogen-transfer  and  hydrogen-transfer  representative  barrier  height  databases. 
All  the  mean  errors  (MSE  and  MUE)  by  these  methods  are  given  in  Table  1  for  non-hydrogen- 
transfer  full  data  set  and  in  Table  3  for  both  non-hydrogen-transfer  and  hydrogen-transfer 
representative  subsets.  We  also  tabulated  a  value  for  MMUE  (mean  MUE)  as  defined, 

MMUE  =  -[MUE  (HAT)  +  MUE(NS)  +  MUE(U  A)}  (4) 

3 

in  Table  1  or 

MMUE  =  i  [MUE(HAT)  +  MUE(NS)  +  MUE(UA)  +  MUE(HT)\  (5) 

in  Table  3.  In  the  two  tables,  all  methods  are  sorted  by  MMUE  and  the  scaling  order  a,  where 
a  method’s  computational  cost,  in  the  limit  of  a  large  number  N  of  atoms,  scales94  as  Na. 

The  computational  costs  in  Table  1  and  3  are  the  computer  time  for  calculating  an  energy 
gradient  of  the  molecule  phosphinomethanol  divided  by  the  time  for  an  MP2/6-31+G(d,p) 
energy  gradient  calculation  with  the  same  computer  program  on  the  same  computer.  For 
calculations  where  the  energy  was  calculated  with  GaussianOS  or  MOLPRO  we  calculated  the 
MP2/6-31+G(d,p)  gradient  with  the  same  program.  For  semiempirical  molecular  orbital  meth¬ 
ods  calculated  with  MOPAC  or  DFTB,  we  take  the  ratio  to  the  MP2/6-31+G(d,p)  computer 
time  with  GaussianOS  on  the  same  computer.  We  used  a  gradient  calculation  to  illustrate 
the  cost  because  gradients  are  important  for  geometry  optimization  and  dynamics  calculations. 
When  analytic  gradients  are  available  in  the  computer  programs  specified  in  Section  2.3,  we 
used  them.  Otherwise,  we  used  numerical  gradients.  In  GaussianOS  the  cost  for  a  numer¬ 
ical  gradient  of  phosphinomethanol  is  49  times  the  cost  of  a  single-point  energy,  whereas  in 
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MOLPRO  it  is  19  times  the  cost  of  a  single-point  energy.  For  local  DFT  methods,  we  give 
two  costs  corresponding  to  carrying  out  the  calculation  with  and  without  density  fitting.  The 
density  fitting  method95,96  (sometimes  called  resolution  of  the  identity)  employs  auxiliary  basis 
functions  to  represent  the  electron  density;  in  this  way,  the  four-center  two-electron  repulsion 
integrals  are  decomposed  into  three-  and  two-center  integrals,  reducing  the  formal  scaling  from 
IV4  to  N3. 

4.3.1  Non-Hydrogen- Transfer  Reactions 

Among  the  tested  multilevel  methods,  G3SX  gives  the  lowest  mean  errors.  The  MMUEs  of 
G3SX(MP3)  and  BMC-CCSD  are  identical  and  a  little  bit  higher  than  that  of  G3SX,  and  the 
MMUE  of  BMC-CCSD  is  only  13%  higher  than  that  of  G3SX(MP3).  But  the  computational 
cost  of  BMC-CCSD  is  about  6  times  smaller  than  that  of  G3SX(MP3)  and  9  times  smaller 
than  that  of  G3SX.  The  MLSEn+d  methods  give  good  performance  for  neutral  systems,  but 
are  not  as  good  for  general  charged  systems  since  they  lack  diffuse  functions  in  their  basis  sets. 

Among  the  ab  initio  WFT  methods  in  Table  1,  CCSD(T)/aug-cc-pVTZ  gives  a  very  high 
accuracy  with  0.61  kcal/mol  MMUE,  and  underestimates  non-hydrogen-transfer  BHs  slightly. 
But  it  is  very  time-consuming.  The  QCISD(T)/MG3S  method  has  a  slightly  higher  MMUE 
for  non- hydrogen-transfer  reactions  but  with  smaller  cost  than  the  CCSD(T)/aug-cc-pVTZ 
method.  MP4,  MP4SDQ,  MP3,  MP2,  and  HF  systematically  overestimate  the  BHs  of  HAT 
and  UA  reactions  as  shown  by  their  high  positive  MSE. 

Among  the  hybrid  DFT  methods  in  Table  1,  BMK,  PWB6K,  BB1K  and  MPWB1K  are  the 
best  performing  methods  with  regards  to  their  MMUE,  and  they  give  even  higher  accuracy  than 
some  multilevel  methods  for  calculating  BHs  of  non-HT  reactions.  M05  gives  the  lowest  MUE 
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for  NS  reactions,  and  B1B95  gives  the  lowest  MUE  for  UA  reactions.  The  B3LYP*  functional 
improves  upon  the  B3LYP  functional  (see  Table  3)  by  reducing  HF  exchange  energy  from  20% 
to  15%. 

Among  the  local  DFT  methods  in  Table  1,  our  recently  developed  M06-L  functional  shows 
very  good  performance  even  with  a  double-^  basis  set.  All  the  local  DFT  methods  systematically 
underestimate  the  BHs  of  non-HT  reactions  as  shown  by  their  negative  MSE,  except  that  M06-L 
overestimates  barrier  heights  of  UA  reactions  with  a  small  positive  MSE. 

The  Eve  NDDO  semiempirical  methods  tested,  AMI,  PM3,  PM6,  RMl,  and  PDDG/PM3, 
all  underestimate  the  BHs  of  HAT  reactions  and  significantly  overestimate  the  BHs  of  NS  and 
UA  reactions.  But  they  are  faster  than  MP2/6-31+G(d,p)  by  about  five  order  of  magnitude 
for  phosphinomethanol  gradients  and  by  even  more  for  larger  system. 

4.3.2  The  Diverse  DBH24  Database 

For  108  of  the  160  methods  in  Table  1  plus  44  new  methods,  we  list  mean  unsigned  errors  for 
DBH24  in  Table  3.  The  mean  errors  of  HATBH6,  NSBH6,  and  UABH6  shown  in  Table  3  are 
consistent  with  those  of  the  full  data  set  shown  in  Table  3.  This  illustrates  the  representative 
quality  of  these  subsets.  Table  3  also  gives  the  mean  errors  of  hydrogen-transfer  reactions  with 
the  HTBH6  data  set.  Table  3  shows  that  the  quality  of  all  the  tested  methods  in  Table  3  for 
calculating  hydrogen-transfer  barrier  heights  correlates  well  with  their  quality  for  calculating 
non-hydrogen-transfer  barrier  heights.  Mean  errors  of  non-hydrogen-transfer  representative 
data  sets  for  the  methods  shown  in  Table  1  but  not  in  Table  3  are  given  in  Table  S2  of 
Supporting  Information. 

In  Table  3  we  compare  several  basis  sets  for  DFT  methods.  We  found  that  the  MG3S  basis 
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set  is  both  less  expensive  and  more  accurate  than  the  aug-pc2  basis  set  for  DFT  methods, 
although  aug-pc2  was  specially  designed72  for  use  with  DFT  methods.  The  aug-pcl  basis  set 
is  16  times  less  expensive  than  aug-pc2,  but  its  MMUE  is  higher  than  that  of  aug-pc2  by  only 
about  0.6  kcal/mol  (2.00  vs.  1.43  kcal/mol).  Hence  aug-pcl  has  a  better  performance-to-cost 
ratio  than  aug-pc2.  But  it  is  still  less  accurate  than  the  6-31+G(d,p)  basis  set,  at  least  for  the 
MPWB1K  functional.  We  also  tested  the  6-31+G(d,p)  basis  set  with  several  other  functionals, 
and  we  found  that  it  is  competitive  with  MG3S  because,  although  its  computational  costs  are 
about  7-10  times  faster,  its  MMUEs  are  slightly  higher  than  those  of  MG3S.  But  6-31G(d,p) 
gives  much  higher  MMUEs,  in  particular  for  NS  reactions,  due  to  its  lack  of  diffuse  functions, 
a  subject  we  have  discussed  previously.'7  According  to  our  calculations,  MG3S  gives  the  best 
performance  for  barrier  heights  among  the  tested  basis  sets  for  DFT  methods. 

Since  the  SCC-DFTB  method  lacks  parameters  for  halogens,  we  only  calculated  barrier 
heights  for  UA  reactions,  HTBH6  reactions,  and  two  HAT  reactions  (H  +  N20  — >  OH  +  N2  and 
its  reverse)  with  this  method.  SCC-DFTB  gives  better  results  than  the  NDDO  semiempirical 
molecular  orbital  methods  for  UA  reactions  with  a  9.78  kcal/mol  MUE,  and  it  overestimates 
the  barrier  heights  for  UA  reactions  on  average  by  only  1.00  kcal/mol.  But  the  MSE  of  full 
UA  reaction  barrier  heights  (see  Table  1)  becomes  -3.47  kcal/mol.  The  MSEs  of  full  data 
set  and  representative  subset  are  not  consistent  with  each  other  because  DBH24  doesn’t  have 
the  representative  quality  for  semiempirical  methods  since  these  methods  were  not  used  to 
select  the  representative  subset  as  we  discussed  in  Section  3.1.  Nevertheless  the  small  test  sets 
are  instructive  even  for  semiempirical  methods.  The  SCC-DFTB  MUEs  for  hydrogen-transfer 
barrier  heights  and  for  the  one  heavy-atom-transfer  reaction  are  30.53  and  23.02  kcal/mol, 
respectively,  which  are  much  larger  than  those  of  the  NDDO  semiempirical  molecular  orbital 
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methods.  Since  the  SCC-DFTB  method  is  parameterized  against  the  B3LYP  method,  it  is 
interesting  to  compare  them.  If  we  take  B3LYP/MG3S  barrier  heights  as  the  standards,  the 
MUE  of  SCC-DFTB  is  improved  to  8.37  kcal/mol  for  UA  reactions,  and  the  MSE  relative 
to  B3LYP/MG3  is  2.35  kcal/mol.  We  conclude  that  SCC-DFTB  might  be  improved  if  it  is 
reparameterized  against  more  accurate  functionals. 

Table  4  summarizes  the  best  methods  when  methods  are  judged  solely  on  the  basis  of  the 
accuracy  of  the  predicted  barrier  heights,  the  cost  of  the  method,  and  its  scaling.  Any  method 
not  in  Table  4  performs  less  well  for  DBH24  than  at  least  one  method  in  Table  4  that  has 
the  same  or  lower  cost  or  better  scaling.  This  table,  though,  is  not  a  replacement  for  careful 
consideration  when  choosing  a  basis  set.  For  example,  BB1K  is  better  than  MPWB1K  and 
M05-2X,  on  average,  for  barrier  heights,  but  only  slightly  so,  but  MPWB1K  and  M05-2X  are 
better  as  all-purpose  functionals,  especially  for  noncovalcnt  interactions.52,9'  Therefore  Table  5 
lists  some  additional  recommended  methods  with  performance  for  barrier  heights  almost  as 
good  as  those  in  Table  4  and  with  good  overall  quality  on  a  performance-for-a-given-cost  basis. 

5  Conclusions 

Three  small  but  representative  data  sets,  HATBH6,  NSBH6,  and  UABH6,  are  identified  for 
the  barrier  heights  of  heavy-atom  transfer,  nucleophilic  substitution,  and  unimolecular  and 
association  reactions,  respectively.  They  are  representative  of  the  full  data  set  within  12% 
(HAT),  9%  (NS),  and  11%  (UA),  respectively.  We  combine  these  data  sets  with  a  previous 
small  representative  data  set  for  hydrogen-transfer  reactions  to  create  a  diverse  representative 
data  set  of  zero-point-exclusive  barrier  heights  called  DBH24.  Assessment  of  methods  with 
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DBH24  shows  that  DFT  and  multilevel  methods  have  much  better  performance-to-cost  ratios 
than  single-level  WFT  methods.  The  best  N 6  method  is  BMC-CCSD,  and  its  cost  is  an  order 
of  magnitude  smaller  than  the  best  N1  methods,  although  it  is  almost  as  accurate.  The  two 
best  NA  methods,  PWB6K  and  BB1K,  outperform  the  best  N5  method  MC3BB.  The  best 
local  DFT  method  is  M06-L.  The  MG3S  basis  set  gives  the  best  performance  for  DFT  methods 
among  the  tested  basis  sets. 
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Table  1:  Calculated  errors  and  costs  of  various  methods  for  non- hydrogen-transfer  reactions  in 


NHTBH38/04. 


Methods  HAT(12)  NS(16)  UA(10) 


MSE 

MUE 

MSE 

MUE 

MSE 

MUE 

MMUEa 

cost6 

N 7  Methods 

G3SX 

-0.59 

0.74 

-0.50 

0.50 

-0.16 

0.56 

0.60 

246 

CCSD(T) /aug-cc-pVTZ 

-0.34 

0.71 

-0.71 

0.71 

-0.18 

0.41 

0.61 

12898 

G3SX(MP3) 

-0.47 

0.76 

0.13 

0.88 

-0.21 

0.85 

0.75 

175 

MCG3-MPW 

-0.82 

1.16 

-0.40 

0.58 

-0.22 

0.73 

0.82 

113 

MCG3-MPWB 

-0.94 

1.15 

-0.16 

0.65 

-0.26 

0.76 

0.85 

114 

QCISD(T)/MG3S 

1.04 

1.21 

-0.62 

1.08 

0.30 

0.53 

0.94 

5076 

MCG3-TS 

-0.74 

1.35 

-0.91 

0.91 

-0.04 

0.59 

0.95 

108 

MCG3/3 

-0.61 

1.17 

-0.35 

0.94 

-0.48 

0.91 

1.01 

102 

MLSE4+d 

-0.44 

1.04 

-3.64 

4.47 

-0.36 

0.55 

2.02 

186 

MLSE3+d 

0.05 

1.09 

-3.80 

4.64 

-0.20 

0.45 

2.06 

186 

MLSE2+d 

-0.35 

1.02 

-4.37 

5.65 

-0.38 

0.64 

2.44 

186 

MLSEl+d 

-0.15 

1.22 

-4.77 

5.90 

-0.33 

0.53 

2.55 

186 

QCISD  (T )  /  cc-pV  (D+d)  Z 

2.14 

3.04 

-4.32 

7.87 

0.00 

1.63 

4.18 

160 

QCISD(T)/6-31G(d) 

1.84 

5.66 

-3.30 

7.77 

0.96 

2.95 

5.46 

63 

MP4/6-31+G(d) 

8.53 

9.14 

-0.95 

2.72 

3.56 

5.03 

5.63 

84 

MP4  /  6-3 1G  (2df,p) 

8.01 

8.01 

-5.03 

9.13 

3.74 

4.18 

7.11 

699 

MP4/6-31G(d) 

8.64 

9.24 

-3.57 

8.08 

3.70 

5.54 

7.62 

37 

N 6  Methods 

BMC-CCSD 

-0.27 

1.14 

-0.02 

0.42 

-0.39 

0.69 

0.75 

27 

MC-QCISD/3 

1.26 

1.53 

-0.15 

0.53 

0.44 

0.61 

0.89 

23 

27 


Table  1:  Continued. 


Methods  HAT(12)  NS(16)  UA(10) 


MSE 

MUE 

MSE 

MUE 

MSE 

MUE 

MMUE° 

cost6 

MCQCISD-MPW 

-1.16 

1.38 

-0.17 

0.51 

-0.11 

0.85 

0.91 

34 

MCQCISD-MPWB 

-0.94 

1.27 

0.07 

0.78 

-0.09 

0.85 

0.97 

35 

MCQCISD-TS 

-1.37 

1.51 

-0.98 

0.98 

-0.02 

0.70 

1.06 

31 

MCUT-MPWB 

1.36 

2.15 

-0.07 

0.58 

1.04 

1.31 

1.35 

34 

CCSD/aug-cc-pVTZ 

2.44 

2.44 

1.70 

1.70 

0.76 

0.80 

1.65 

3868 

MCUT-MPW 

1.31 

2.72 

-0.74 

0.75 

1.16 

1.48 

1.65 

33 

MCUT-TS 

0.90 

2.71 

-1.43 

1.43 

1.12 

1.24 

1.79 

28 

QCISD/MG3S 

3.43 

3.43 

1.26 

1.32 

1.04 

1.08 

1.94 

152 

CCSD/6-31+G(d,p) 

4.31 

4.59 

1.88 

1.92 

2.03 

2.03 

2.85 

12 

CCSD/6-31+B(d,p) 

4.53 

6.48 

0.68 

1.97 

2.19 

2.19 

3.55 

11 

MC-UT/3 

7.47 

7.47 

0.23 

0.33 

3.02 

3.02 

3.61 

22 

MP4SDQ  /aug-cc-pVTZ 

7.24 

7.24 

1.07 

1.07 

2.62 

2.62 

3.64 

673 

MP4SDQ/MG3S 

8.60 

8.60 

1.42 

1.44 

3.08 

3.12 

4.39 

95 

CCSD/6-31B(d) 

4.49 

7.09 

-0.39 

4.70 

1.98 

2.73 

4.84 

2.3 

MP4SDQ/6-31+G(p,d) 

9.55 

9.55 

1.18 

1.86 

4.05 

4.05 

5.15 

3.4 

MP3  /  aug-cc-pVTZ 

9.24 

9.24 

3.09 

3.09 

3.69 

3.69 

5.34 

530 

MP4SDQ  /  6-31+B(d,p) 

9.36 

9.78 

0.07 

2.34 

4.16 

4.16 

5.43 

3.3 

QCISD/6-31G(d) 

3.43 

5.93 

-2.37 

7.53 

1.53 

2.89 

5.45 

1.7 

MP3/MG3S 

10.59 

10.59 

3.62 

3.62 

4.14 

4.14 

6.12 

71 

MP3/6-31+B(d,p) 

11.35 

11.50 

2.54 

2.86 

5.21 

5.21 

6.52 

2.8 

MP4SDQ  /  cc-pV  (D+d)Z 

9.23 

9.23 

-2.44 

6.95 

2.81 

3.79 

6.66 

2.8 

MP3/6-31+G(d) 

11.05 

11.05 

3.55 

3.55 

4.78 

5.37 

6.66 

1.4 

28 


Table  1:  Continued. 


Methods 

HAT(12) 

NS(16) 

UA(10) 

MSE 

MUE 

MSE 

MUE 

MSE 

MUE 

MMUE“ 

cost6 

MP3/6-31+G(d,p) 

11.49 

11.49 

3.66 

3.66 

5.08 

5.08 

6.74 

2.7 

MP4SDQ  /  6-31G(2df,p) 

8.68 

8.68 

-2.73 

7.73 

3.86 

3.98 

6.80 

14 

MP4DQ/6-31B(d) 

11.20 

11.48 

1.14 

4.98 

4.92 

5.55 

7.34 

0.83 

MP4SDQ/6-31G(d) 

9.07 

9.34 

-1.94 

7.58 

3.86 

5.23 

7.38 

0.87 

MP3/6-31G(2df,p) 

10.64 

10.64 

-0.81 

6.90 

4.91 

4.91 

7.48 

12 

MP3/6-31G(d) 

10.97 

10.97 

0.02 

6.93 

4.89 

5.81 

7.90 

0.73 

N 5  Methods 

MCCO-MPWB 

0.60 

1.86 

1.29 

1.60 

0.95 

1.14 

1.53 

33 

MC3BB 

1.18 

2.44 

0.55 

0.66 

1.53 

1.53 

1.54 

14 

MC3MPWB 

1.23 

2.55 

0.18 

0.67 

1.57 

1.57 

1.60 

14 

MCCO-TS 

0.78 

2.75 

-0.92 

1.32 

1.34 

1.52 

1.86 

27 

MC3MPW 

1.50 

2.69 

0.17 

0.89 

2.14 

2.20 

1.93 

13 

MC3TS 

1.35 

2.82 

-0.90 

1.05 

2.09 

2.16 

2.01 

7.4 

MCCO-MPW 

2.13 

2.97 

0.63 

1.48 

1.75 

1.85 

2.10 

32 

MP2  /  aug-cc-pVTZ 

10.46 

10.46 

0.37 

0.67 

4.29 

4.88 

5.34 

140 

MC-CO/3 

10.61 

10.61 

0.08 

1.37 

4.04 

4.04 

5.34 

21 

MP2  /  aug-cc-p  V  (D+d)  Z 

10.28 

10.28 

-0.95 

1.27 

3.80 

4.74 

5.43 

4.3 

MP2/6-31++G(2df,2pd) 

11.07 

11.07 

0.55 

0.65 

4.64 

5.15 

5.62 

13 

MP2/6-311++G(2df,2pd) 

11.07 

11.07 

0.55 

0.65 

4.64 

5.15 

5.62 

18 

MP2/MG3 

11.46 

11.46 

0.70 

0.73 

4.65 

5.40 

5.86 

14 

MP2(full)/6-31G(2df,p) 

11.41 

11.41 

0.91 

0.91 

4.72 

5.46 

5.93 

3.6 

MP2/MG3S 

11.76 

11.76 

0.74 

0.74 

4.71 

5.44 

5.98 

13 

29 


Table  1:  Continued. 


Methods 

HAT(12) 

NS(16) 

UA(10) 

MSE 

MUE 

MSE 

MUE 

MSE 

MUE 

MMUE° 

cost6 

MP2/6-31++G(d,p) 

11.19 

11.19 

1.12 

2.18 

5.28 

5.87 

6.41 

1.1 

MP2/6-31+G(d,2p) 

12.04 

12.04 

0.97 

2.21 

4.96 

5.31 

6.52 

1.5 

SAC/3C 

11.55 

11.55 

0.26 

2.84 

5.27 

5.81 

6.73 

2.5 

MP2/6-31+B(d,p) 

12.15 

12.15 

-0.22 

2.62 

5.57 

5.76 

6.84 

1.0 

MP2/6-31+G(d,p) 

12.56 

12.56 

1.14 

2.21 

5.49 

5.93 

6.90 

1.0 

SAC-MP2/6-31+G(d,p) 

12.06 

12.06 

0.32 

2.95 

5.93 

6.44 

7.15 

1.7 

MP2 /cc-pV  (T+d)  Z 

11.75 

11.75 

-2.14 

4.42 

4.49 

5.30 

7.16 

15 

MP2/6-311++G(d,p) 

12.57 

12.57 

3.28 

3.28 

4.71 

5.89 

7.25 

2.0 

MP2/6-311+G(d,p) 

12.57 

12.57 

3.28 

3.28 

4.71 

5.89 

7.25 

1.7 

MP2/6-31+G(d) 

12.14 

12.54 

1.08 

2.32 

5.31 

7.07 

7.31 

0.60 

MP2/6-31B(d) 

12.00 

12.76 

-1.45 

4.99 

5.57 

7.10 

8.28 

0.41 

MP2/cc-pV(D+d)Z 

12.09 

12.09 

-2.75 

7.29 

4.23 

5.95 

8.44 

1.4 

MP2/cc-pVDZ 

12.20 

12.20 

-2.97 

7.45 

4.23 

5.95 

8.53 

0.92 

MP2/6-31G(2df,p) 

11.63 

11.63 

-3.47 

8.28 

5.34 

6.23 

8.71 

3.0 

MP2/6-31G(d) 

12.00 

12.43 

-2.22 

8.02 

5.37 

7.54 

9.33 

0.41 

S  AC-MP2/ 6-3 1G  (d) 

11.51 

13.32 

-3.64 

8.85 

6.11 

8.69 

10.29 

0.56 

N 4  Methods 

BMK/MG3S 

-1.21 

1.49 

0.75 

0.91 

0.80 

1.58 

1.33 

13 

PWB6K/MG3S 

-0.24 

1.61 

0.94 

1.10 

0.65 

1.53 

1.41 

12 

BB1K/MG3S 

-0.69 

1.58 

1.23 

1.30 

0.53 

1.44 

1.44 

12 

MPWB1K/MG3S 

-0.77 

1.69 

1.08 

1.19 

0.52 

1.61 

1.50 

12 

PWB6K/aug-pc2 

-0.68 

1.59 

1.48 

1.48 

0.53 

1.51 

1.53 

64 

MPWBlK/aug-pc2 

-1.19 

1.81 

1.66 

1.66 

0.40 

1.58 

1.68 

64 
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Table  1:  Continued. 


Methods 

HAT(12) 

NS(16) 

UA(10) 

MSE 

MUE 

MSE 

MUE 

MSE 

MUE 

MMUEa 

cost1 

BBlK/6-31+G(d,p) 

-0.68 

1.86 

1.02 

1.17 

0.84 

2.10 

1.71 

2.0 

M05-2X/MG3S 

1.15 

2.00 

-0.79 

1.48 

0.91 

1.77 

1.75 

13 

MPW1K/MG3S 

-0.83 

1.89 

1.12 

1.28 

0.96 

2.42 

1.86 

12 

MPWBlK/aug-pcl 

-1.78 

2.92 

0.20 

1.15 

0.02 

1.95 

2.01 

3.7 

M05-2X/6-31+G(d,p) 

1.06 

2.53 

-0.63 

1.68 

1.00 

1.90 

2.04 

2.2 

BHandHLYP /MG3S 

0.07 

3.04 

0.95 

1.39 

0.76 

1.98 

2.14 

10 

M05/MG3S 

-2.84 

3.79 

0.00 

0.80 

0.69 

2.24 

2.28 

13 

B97-2/MG3S 

-3.13 

3.52 

-1.43 

1.47 

0.62 

1.91 

2.30 

11 

B1B95/MG3S 

-4.73 

4.73 

-0.95 

1.08 

-0.58 

1.21 

2.34 

12 

MPW1B95/MG3S 

-4.62 

4.62 

-0.81 

1.21 

-0.52 

1.31 

2.38 

12 

MPWKCIS1K/MG3S 

-1.94 

2.82 

1.69 

1.69 

0.95 

2.97 

2.49 

13 

M05/6-31+G(d,p) 

-2.73 

4.29 

-0.74 

1.14 

0.81 

3.06 

2.83 

2.1 

PW6B95/MG3S 

-5.36 

5.36 

-2.05 

2.05 

-0.76 

1.43 

2.95 

12 

mPW  1PW /MG3S 

-5.99 

5.99 

-1.81 

1.94 

-0.38 

2.00 

3.31 

11 

B98/6-311+G(3df,2p) 

-5.19 

5.19 

-2.93 

2.93 

-0.39 

1.96 

3.36 

10 

B98/MG3S 

-5.18 

5.18 

-2.96 

2.96 

-0.31 

1.97 

3.37 

11 

B97-1/MG3S 

-5.18 

5.18 

-3.21 

3.21 

-0.23 

1.83 

3.41 

11 

PBE1PBE/MG3S 

-6.62 

6.62 

-1.87 

2.05 

-0.58 

2.16 

3.61 

10 

PBE1KCIS/MG3 

-8.56 

8.56 

-1.77 

1.88 

-0.86 

2.64 

4.36 

12 

X3LYP/MG3S 

-8.48 

8.48 

-2.89 

2.90 

-1.43 

2.06 

4.48 

11 

B3LYP/MG3S 

-8.49 

8.49 

-3.25 

3.25 

-1.42 

2.02 

4.59 

9.4 

rHCTHh/MG3S 

-6.73 

6.73 

-4.53 

4.53 

-0.23 

2.65 

4.64 

11 

MP  WB  lK/cc-pVDZ 

-2.06 

3.14 

-5.79 

9.40 

0.01 

1.74 

4.76 

1.6 
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Table  1:  Continued. 


Methods  HAT(12)  NS(16)  UA(10) 


MSE 

MUE 

MSE 

MUE 

MSE 

MUE 

MMUEa 

cost6 

03LYP/MG3S 

-8.27 

8.27 

2.61 

4.42 

-1.02 

2.27 

4.99 

11 

B3LYP/6-31+G(d,p) 

-8.79 

8.79 

-3.59 

3.59 

-1.27 

2.65 

5.01 

1.4 

MPW3LYP/MG3S 

-9.29 

9.29 

-4.29 

4.29 

-1.61 

2.21 

5.26 

11 

MPW1KCIS/MG3S 

-9.89 

9.89 

-3.46 

3.46 

-1.27 

2.61 

5.32 

13 

TPSS1KCIS/MG3S 

-9.26 

9.26 

-4.88 

4.88 

-1.39 

2.12 

5.42 

13 

B3LYP*/MG3T 

-6.02 

6.02 

-6.92 

8.40 

1.45 

4.00 

6.14 

7.5 

B97-2/6-31G(d) 

-4.25 

5.85 

-7.33 

11.03 

0.95 

2.16 

6.35 

0.89 

TPSSh/MG3S 

-11.51 

11.51 

-5.78 

5.78 

-2.94 

3.23 

6.84 

13 

B3LYP/6-311G(2df,2p) 

-8.92 

8.92 

-8.97 

11.05 

-1.71 

1.98 

7.32 

5.2 

B3LYP/6-311G(2d,p) 

-8.89 

8.89 

-9.68 

11.65 

-1.85 

1.98 

7.51 

1.5 

BB1K/MIDIY 

-6.80 

6.90 

-11.71 

16.40 

0.03 

1.83 

8.38 

1.4 

HF  /  aug-cc-pV  (D+d)Z 

14.23 

16.34 

5.23 

5.23 

2.35 

3.77 

8.45 

0.55 

B3LYP/6-31G(d,p) 

-9.99 

9.99 

-9.71 

12.92 

-1.51 

2.49 

8.47 

0.92 

HF  /  cc-pV  (T+d)  Z 

15.03 

16.97 

4.18 

5.04 

2.49 

3.75 

8.59 

1.9 

HF/cc-pV(D+d)Z 

14.19 

16.50 

1.20 

5.64 

2.22 

3.80 

8.65 

0.45 

HF/6-31+B(d,p) 

14.55 

17.73 

4.15 

4.34 

3.23 

3.99 

8.69 

0.87 

HF/cc-pVDZ 

14.25 

16.57 

0.93 

5.80 

2.22 

3.80 

8.72 

0.28 

HF/6-31G(d) 

13.63 

15.47 

2.52 

6.74 

2.95 

4.23 

8.81 

0.15 

HF  /  aug-cc-pVTZ 

14.43 

16.39 

6.76 

6.76 

2.53 

3.72 

8.96 

45 

HF/6-31B(d) 

13.93 

17.34 

3.22 

5.16 

3.43 

4.40 

8.97 

0.17 

HF/6-31+G(d,2p) 

15.33 

17.44 

5.66 

5.66 

2.86 

3.91 

9.00 

0.97 

HF/6-31+G(d,p) 

15.29 

17.38 

5.72 

5.72 

3.03 

3.95 

9.02 

0.67 

HF/MG3 

14.64 

16.67 

6.67 

6.67 

2.67 

3.79 

9.04 

9.2 
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Table  1:  Continued. 


Methods 

HAT  (12) 

NS(16) 

UA(10) 

MSE 

MUE 

MSE 

MUE 

MSE 

MUE 

MMUEa 

cost6 

HF  /  G3XLarge 

14.62 

16.64 

6.73 

6.73 

2.67 

3.79 

9.05 

18 

HF  /  G3Large 

14.64 

16.66 

6.72 

6.72 

2.67 

3.79 

9.06 

13 

HF/MG3S 

14.86 

16.87 

6.67 

6.67 

2.70 

3.82 

9.12 

7.9 

N3  Methods 


M06-L/MG3S 

-5.68 

5.93 

-3.56 

3.56 

0.27 

2.35 

3.95 

5.7/8.9d 

M06-L/6-31+G(d,p) 

-5.36 

6.22 

-4.55 

4.55 

0.16 

2.27 

4.35 

2.1/2.2 

HCTH/MG3S 

-8.84 

8.84 

-2.71 

2.71 

-0.75 

2.20 

4.58 

3. 9/6. 7 

VSXC/MG3S 

-7.44 

7.44 

-5.30 

5.30 

-0.91 

2.40 

5.05 

5. 2/8. 8 

VSXC/6-31+G(d,p) 

-7.58 

7.58 

-5.52 

5.52 

-0.67 

2.90 

5.33 

1. 9/2.0 

OLYP/MG3S 

-11.23 

11.23 

-2.73 

2.73 

-1.92 

2.53 

5.50 

3. 7/6. 8 

rHCTH/MG3S 

-9.21 

9.21 

-5.71 

5.71 

-1.04 

2.82 

5.91 

5. 4/8. 8 

G96LYP/MG3S 

-13.03 

13.03 

-5.80 

5.80 

-2.86 

3.04 

7.29 

3. 8/7. 3 

mPWKCIS /MG3S 

-13.65 

13.65 

-6.66 

6.66 

-2.67 

3.07 

7.79 

6. 1/9.7 

BB95/MG3S 

-13.88 

13.88 

-6.36 

6.36 

-3.22 

3.40 

7.88 

5. 4/8. 7 

TPSSKCIS /MG3S 

-13.37 

13.37 

-7.64 

7.64 

-2.56 

2.98 

8.00 

5. 7/9. 3 

MPWPW/MG3S 

-14.10 

14.10 

-7.45 

7.45 

-2.67 

3.10 

8.22 

3.8/7. 1 

PBE1W/MG3S 

-14.51 

14.51 

-7.06 

7.06 

-2.69 

3.13 

8.23 

3. 8/7. 4 

PBE/MG3S 

-14.93 

14.93 

-6.97 

6.97 

-2.94 

3.35 

8.42 

3. 7/7. 3 

BP86/MG3S 

-15.51 

15.51 

-6.91 

6.91 

-3.41 

3.87 

8.76 

3. 9/6. 9 

PW91/MG3S 

-15.42 

15.42 

-7.75 

7.75 

-2.78 

3.22 

8.80 

3. 8/6. 9 

TPSS/MG3S 

-14.65 

14.65 

-7.75 

7.75 

-3.84 

4.04 

8.81 

5. 5/8. 8 

BLYP/MG3S 

-14.66 

14.66 

-8.40 

8.40 

-3.38 

3.51 

8.86 

3. 8/7. 3 

mPWLYP /MG3S 

-15.76 

15.76 

-8.14 

8.14 

-3.64 

3.79 

9.23 

3. 8/7. 3 
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Table  1:  Continued. 


Methods 

HAT  (12) 

NS(16) 

UA(10) 

MSE 

MUE 

MSE 

MUE 

MSE 

MUE 

MMUEa 

cost6 

SPL/MG3S 

-23.36 

23.36 

-8.58 

8.58 

-5.08 

5.82 

12.59 

2. 5/5. 8 

SPWL/MG3S 

-23.48 

23.48 

-8.50 

8.50 

-5.17 

5.90 

12.63 

3. 5/5. 9 

PM6 

-18.83 

19.37 

1.76 

6.54 

9.61 

15.66 

13.86 

5xl0"5 

AMI 

-8.99 

10.60 

16.02 

19.08 

9.04 

15.01 

14.90 

5xl0-5 

PM3 

-13.97 

15.88 

19.91 

20.24 

3.53 

11.25 

15.79 

5xl0"5 

RMl 

-14.80 

18.34 

6.59 

18.43 

6.72 

14.28 

17.02 

5xl0"5 

PDDG/PM3 

-21.19 

21.90 

20.53 

20.53 

3.01 

10.52 

17.65 

5xl0"5 

SCC-DFTB 

-23.02® 

23.02® 

-3.47 

9.94 

16.48-f 

4xl0-4 

“  MMUE  is  defined  as  Eq.  4. 

b  The  cost  for  each  method  is  measured  by  the  computer  time  for  an  energy  gradient  calculation  of  phosphi- 
nomethanol  divided  by  the  computer  time  for  an  MP2/6-31+G(d,p)  energy  gradient  calculation  with  the  same 
software  (except  semiempirical  molecular  orbital  methods,  see  text)  on  the  same  computer.  Although  such 
costs  depend  to  some  extent  (for  example,  15%)  on  the  machine,  the  program,  and  the  computer  load,  they  still 
provide  a  useful  indication  of  computer  resource  demand. 
c  Also  called  SAC-MP2/6-31+G(d,2p) 

dThe  double  entries  for  local  DFT  methods  correspond  to  timings  with/without  density  fitting. 
e  The  forward  and  reverse  barriers  of  the  reaction  H  +  N2O  — *  OH  +  N2. 

‘  Mean  value  of  MUE  for  the  forward  and  reverse  barriers  of  the  reaction  H  +  N2O  — >  OH  +  N2  and  the 
MUE  for  UA(10). 
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Table  2:  Representative  barrier  heights  database  DBH24. 


Database 

Reaction 

V^a 

v/ 

Vfa 

H  +  N20  — > 

OH  +  N2 

18.14 

83.22 

HATBH6 

H  +  C1H  — > 

HC1  +  H 

18.00 

18.00 

CH3  +  FC1  - 

—  CH3F  +  Cl 

7.43 

61.01 

ci--ch3ci 

— ♦  C1CH3-  •  •  Cl- 

13.61 

13.61 

NSBH6 

f--ch3ci 

— >  fch3-  •  •  cr 

2.89 

29.62 

OH"  +  CH3F 

1  — ►  HOCH3  +  F- 

-2.78 

17.33 

H  +  N2  — *  HN2 

14.69 

10.72 

UABH6 

H  +  C2H4  — 

-  ch3ch2 

1.72 

41.75 

HCN  — *  HNC 

48.16 

33.11 

OH  +  CH4  - 

-  ch3  +  h2o 

6.7 

19.6 

HTBH6 

H  +  OH  — ► 

o  +  h2 

10.7 

13.1 

H  +  H2S  — > 

H2  +  HS 

3.6 

17.3 

a  Vj  denotes  forward  BH,  and  denotes  reverse  BH  (in  keal/mol). 
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Table  3:  Calculated  mean  errors  of  various  methods  for  non-hydrogen-transfer  and  hydrogen-transfer  representative  databases 
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Table  3:  Continued. 
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Table  3:  Continued. 
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Table  4:  Best  methods  for  barrier  height  calculations  as  judged  by  performance  for  a  given  cost 
and  scaling. 


Method 

Scaling 

Cost 

MMUE/DBH24 

(kcal/mol) 

G3SX 

7 

246 

0.64 

BMC-CCSD 

6 

27 

0.72 

PWB6K/MG3S 

4 

12 

1.21 

BBlK/6-31+G(d,p) 

4 

2.0 

1.70 

MPWlK/6-31+G(d,p) 

4 

1.4 

1.92 

MPW1K/MIDIX+ 

4 

1.0 

3.10 

MPWlK/6-31B(d) 

4 

0.80 

3.66 

MPW1K/MIDI! 

4 

0.66 

7.97 

HF/MIDI! 

4 

0.10 

8.98 

M06-L/MG3S 

3 

5.7 

4.34 

M06-L/6-31+G(d,p) 

3 

2.1 

4.55 

M06-L/6-31G(d) 

3 

1.6 

6.47 

BLYP/6-31+G(d,p) 

3 

1.3 

8.25 

PM3 

3 

5xl0-5 

12.85 

46 


Table  5:  Other  methods  to  be  considered  for  reaction  dynamics  calculations  based  on  overall 
quality. 


Method 

Scaling 

Cost 

MMUE/DBH24 

(kcal/mol) 

G3SX(MP3) 

7 

175 

0.71 

MCG3-MPW 

7 

113 

0.79 

MCQCISD-MPWB 

6 

35 

0.95 

MC3MPWB 

5 

14 

1.37 

M05-2X/MG3S 

4 

13 

1.61 

M05-2X/6-31+G(d,p) 

4 

2.1 

1.92 

PWB6K/6-31+G(d,p) 

4 

2.0 

1.73 

AMI 

3 

5xl0-5 

12.88 
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